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(57) Abstract 

The invention relates to a 
protein stabilizing sequence parti- 
cularly useful for stabilization of 
proteolytically sensitive proteins. 
The sequence includes a relatively 
small number of amino acids that 
may be expressed fused with a pro- 
teolytically sensitive protein. The 
most effective stabilization se- 
quences assume a-helix structures 
with a hydrophobic face and a posi- 
tively charged polar face which ap- 
pear to require proper orientation 
with respect to each other. Other 
aspects of the invention include 
cloning vectors incorporating a 
gene sequence encoding the stabili- 
zation polypeptide and production 
of stabilized antigenic proteins. 
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DESCRIPTIOM 

8EOPENCE FOR STABILIZING PROTEINS IN BACTERIA 
PROTEIN STABILIZATION SEQUENCE 

The United States Government may have certain rights 
in the present invention pursuant to the terms of Grant 
No. 86-13978 awarded by the National Science Foundation. 



BACKGROUND OF THE INVENTION 
Field of the Invention 

The invention relates to an amino acid sequence 
useful for stabilization of otherwise unstable proteins, 
particularly proteolytically sensitive proteins. The 
invention also relates to methods of producing stabilized 
proteins by direct attachment of the stabilizing sequence 
or through fusion proteins expressed from recombinant 
host cells. 

Description of Related Art 

One of the major problems in the use and preparation 
of therapeutic and commercial proteins is degradation by 
cell proteases. This may occur in vivo in plasma or 
inside a host cell when recombinant methods of 
polypeptide production are employed. Degradation plays 
an obvious role in the elimination of damaged or abnormal 
proteins but also affects half lives of normal proteins. 
Individual protein turnover rates may vary 100 0 fold, 
depending on environmental and structural factors. 

Two general factors contribute to the half-life of 
any given protein. One factor includes "global" features 
such as large size, hydrophobicity , thermal instability 
and charge characteristics. The other factor is related 
to sequence specific parameters of particular a-amino 
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terminii, asparagine residues and so-called PEST 
sequences. Although correlations have been made between 
particular features and susceptibility to degradation, 
explicit degradation signals and pathways are not well 
defined. It is not even known whether peptide bond 
cleavage or biochemical marking is the primary event 
signaling protein destruction. 

In contrast to degradation factors, there appear to 
be stabilization factors that enhance stability of a 
protein toward proteolysis. It has been suggested that 
both thermodynamic stability and proteolytic 
susceptibility are major determining factors in rate of 
degradation (Parsell and Sauer, 1989). There has been 
some evidence that particular sequences at the N-terminus 
of lambda repressor in Escherichia coli aid in 
maintaining a stable tertiary structure and therefore 
slow proteolytic degradation (Parsell and Sauer, 1989) . 
Some evidence also suggests that the increased stability 
of certain fusion proteins may derive from the ability of 
a stable fusion partner such as protein G or 
^-galactosidase to stabilize the tertiary structure of a 
proteolytically sensitive protein (Hellebust et al . , 
1989) . 

A particular amino acid sequence attached as a 
"tail" to the C-terminal of Arc, a small dimeric 
DNA-binding protein found in bacteriophage P22 , has been 
found to stabilize that protein against proteolysis. The 
primary sequence of the tail consists of 25 amino acids 
which when attached to a heterologous protein, the LP57 
mutant of X-repressor in Escherichia coli, also 
stabilized that protein (Bowie and Sauer, 1989) . 

Nevertheless, up until now, an amino acid sequence 
attachable to a proteolytically sensitive protein has not 
been found which will generally increase stability of 
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these proteins o The value of such a sequence would lie 
in its ability to increase the half-life of commercially 
desired proteins for production in recombinant cell hosts 
and the potential to protect proteinaceous vaccines from 
5 serum protease degradation when injected directly into an 

animal. 



SUMMARY OF THE INVENTION 

10 

The invention generally relates to a protein 
stabilization sequence- The sequence is comprised of 
amino acids that may be attached to proteolytically 
sensitive proteins* 

15 

The stabilization sequence includes a limited number 
of amino acids ranging from about ten to about 5 0 
residues. The amino acids need not be confined to a 
particular type of amino acid; however, the amino acid 
2 0 residues will be such that the secondary and tertiary 

structure assumes the form of an outwardly directed, 
properly aligned hydrophobic face and a positively 
charged polar face. The importance of this structure has 
been shown by substituting two of the amino acids causing 

2 5 the structure to have a twisted form. This form was 

shown to be less stable. The alignment of amino acid 
groups within a particular a-helical form appears to be 
important in order for the amino acid sequence to have 
maximal effect as a stabilization sequence. 

30 

The a-helical structure assumed by the amino acid 
protecting sequence is an important aspect of the 
invention. Thus it is number and type of amino acid 
residues involved, and the structure assumed by the amino 

3 5 acid residues making up the structure. Twisted forms of 

the a-helix structure with the hydrophobic and polar 
faces out of alignment provide less stability to fused 
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heterologous proteins. Thus in preferred embodiments the 
outwardly directed hydrophobic face of the structure is 
in alignment with positively charged face of the a-helix. 

Although it is believed that such stabilization 
structures could be formed with as many as a 100 amino 
acid residues, the amino acid sequence used is preferably 
from about 10 to about 50 acid residues and more 
preferably 29 amino acid residues. A particularly 
preferred amino acid sequence is shown in Figure i. 

Another aspect of the invention is the attachment of 
the protein stabilization sequence to a proteolytically 
sensitive enzyme. Such attachments may be made at the 
time of synthesis of the proteolytically sensitive 
protein of interest or alternatively may be used in in 
vitro situations where it is desired to protect a 
proteolytically sensitive protein from degradation. An 
example might include the use of the protecting sequence 
to protect an antigenic protein that is being used to 
promote antibody formation in vivo. Protection of the 
antigen from serum proteases would provide more effective 
use of expensive antigenic materials that are in short 
supply. 

In a preferred embodiment, the protein stabilization 
sequence is prepared in the form of a cassette in which 
the DNA segment encoding the sequence is incorporated 
into a cloning vector. Proper construction of the vector 
results in a protein of interest being expressed with the 
polypeptide protecting group attached to the protein, 
thus protecting it from protease degradation within the 
host cell in which it is expressed. 

Attachment of the protein stabilization sequence is 
generally to the amino-terminal end of the 
proteolytically sensitive protein or to the carboxy 
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teriainal end. In some cases, for example, exceptionally 
long proteolytically sensitive proteins it may be 
desirable to attach the stabilization sequence to both 
termini. In vitro attachment methods may be readily 
accomplished using chemical attachments well known to 
those of skill in the art. Attachment of such sequences 
in vivo is possible by constructing appropriately 
designed cloning rectors* 

The protein stabilization sequence described has 
general utility and is not limited to stabilization of 
the polymerase sigma factor from which it was originally 
derived. RNA polymerase sigma factors from B. subtilis 
or calx have been prepared as fusion proteins with the 
stabilization factor originally isolated from B. subtilis 
sigma factor. It is envisioned that virtually any DNA 
segment encoding a polypeptide could be constructed in a 
cloning vector or expression vector with the DNA segment 
encoding the stabilization factor sequence. Examples of 
other proteolytically sensitive proteins that might be 
protected with the stabilization sequence include 
proinsulin, epidermal growth factor, inter leukin, 
interferon, somato tropin, insulin-like growth factor, 
phosphatase, immunoglobulin Fv and the like. Moreover 
expression may be from a variety of suitable bacterial 
cells provided that they can be suitably transformed by 
an appropriately constructed cloning vector • Examples of 
suitable host microorganisms include E, coli^ B. 
subtilis , S. tymphyrlum and the like. 

The invention is also envisioned as a method of 
enhancing stability of recombinant proteins. A first 
step would be to obtain a gene segment encoding the 
protein stabilization sequence. Such a sequence is not 
limited to a particular number or type of amino acid 
residues provided that the residues form the appropriate 
structure having the stabilization properties. Thus 
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under certain conditions it may be desirable to encode up 
to 100 amino acids in the sequence. In a further step a 
recombinant vector is prepared. This vector will have a 
gene encoding the protein stabilization sequence selected 
as well as a gene encoding a protein desired to be 
stabilized. Generally^ this is accomplished by preparing 
a cloning site within a plasmid vector that encodes the 
protein stabilization sequence, then inserting into that 
site a gene or a gene segment encoding the protein 
desired to be stabilized. The cloning site may be 
modified to include one or more unique restriction sites 
but in general will be constructed so that insertion of 
the gene of choice will restore a proper reading frame 
between the protein stabilization sequence and the 
cloning site through the desired gene segment. Such a 
recombinant vector may then be used to transform a host 
cell. Transformed host cell colonies will then be 
selected, for example by incorporating selected genes 
within the recombinant vectors such as antibiotic 
resistance genes or alkaline phosphatase expressing genes 
which will elicit positive or negative responses 
depending on whether the desired gene has been cloned 
into the cloning site. Once transformed host cell 
colonies are identified, expression of a heterologous 
protein fused with the af oredescribed protein 
stabilization sequence may be affected. This is 
generally performed using standard culture methods well 
known to those of skill in the art. 

Expression of a wide variety of heterologous 
proteins fused with the protein stabilization sequences 
should be possible. In general proteolytically sensitive 
proteins suitable for production as stable heterologous 
proteins from prokaryotic hosts will be proteins that are 
expressible as active fusion proteins. Such proteins 
include 6-galactosidase, sigma factor and the like. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows an amino acid sequence capable of 
forming a protecting hydrophobic faced a-helix structure 
when attached to a proteolytically sensitive protein. 

Figure 2 shows the nucleic acid sequence encoding a 
29 amino acid protective sequence. 

Figure 3 illustrates generally the a-helix formed by 
amino acid sequences that confer proteolytic stability on 
proteins to which they are N-terminally attached. A 
hydrophobic face and a positively charged polar face are 
aligned on the a-helix. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Materials and Methods 

Bacterial Strains and Plasmids 

B. subtilis strains JH-642 (pheA trpC2) and 1A287 
(rpsl) were obtained from J. Hoch and the Bacillus 
Genetic stock Center (Ohio State University) , 
respectively. SR104 has an internal deletion in sigE. 
It was created by transformation of strain SMY with DNA 
from CMOl (trpC2 pheAl AsigEz term) . The M13 host strains 
CJ2 3'6 (dut ung) and JM103 (dut+ ung+) were obtained from 
v. Deretic (University of Texas Health Science Center at 
San Antonio) . Escherichia coli strain TB-1 was provided 
by Kolodrubetz (University of Texas Health Science 
Center at San Antonio). Plasmids pSR-5 {spoIID: : lacZ) 
(Rong et al . 1986), pSI-1 {pSpac) (Yansura and Henner, 
1984), pGSIIGll (sigE) (Stragier et al . , 1984), and 
pSGMU31 (Errington, 1986) have been described, pJM102, 
obtained from J. Hoch, is pUC18 (Yanisch-Perron et al . , 
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1985) with a chloramphenicol acetyltransf erase (cat: 950 
base pairs [bp]) gene cloned into its unique Ndel site, 
PSGMU3M2 was derived from pSGMU3l by cutting the plasmid 
at its two BamHI sites and single Bgixz site and 
5 relegating the two major fragments. This results in loss 

of a small, nonessential BamHI-Bglll fragment (I50bp) and 
the creation of a single BamHI site in the vector. 
pSGMa31A2 has the BamHI site and the iacZ gene of the 
original plasmid downstream of the plasmid's lac 
10 promoter. pSR22 is pBR322 (Bolivar et al., 1977) and 

PUBIIO (Gryczan et al., 1978) joined at their BamHI sites 
plus a l.l-kbp fragment encoding sigE (Stragier et al. 
1984) . The sigE fragment was cloned into the EcoRI site 
' Of PBR322 by using EcoRI linkers. pSRSl and pSR54 are 
variants of pSR22 with missense mutations in the coding 
sequence of sigE. 



15 



20 



plicTonucleoti df» -DirectPd Mutacrt^n^gT g 



The l.i-kpb PstI fragment of pGSIlGll was cloned 
into M13mpl9 (Yanisch-Perron et al . 1985) , and its 
orientation within the vector was determined by 
restriction endonuclease analysis of RF DNA. To 
25 construct sigEA84, an oligonucleotide (60-mer) that 

carried the sequence 3 0 bases upstream of and including 
the ATG codon (nucleotides 179 to 18 i of the l.i-kpb 
fragment) of p'l and 3 0 bases downstream of the amino 
terminus (nucleotides 266 to 268) was synthesized. The 
3 0 60-mer was hybridized to single-stranded M13 DNA 

containing its complementary sequence within the cloned 
1.1-kpb PstI fragment. The P^'-specific sequence was 
"looped out" as a consequence of using DNA polymerase I 
to synthesize a complement to th4 circular DNA by using 
35 ^=he oligonucleotide as primer. Enrichment for the 

deleted sequence was accomplished by si -nuclease 

treatment of the 8 4 -base loop following the 
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polymerization reaction. The hybridization, 
polymerization, and SI treatments were accomplished as 
described by Adelman et al, (Adelman et al . 1983). 
Following transformation of E. coli JM103, mutagenized 
clones were screened by restriction endonuclease analysis 
for reduction in the size of the cloned PstI fragment 
(i.e., 1,1 kbp to 1.02 kbp) and the loss of the Hlndll 
site originally present in the loop region. 
Approximately 5% of the resulting clones had the desired 
mutation. 

Additional mutant clones sigEASl, sigEAAB , sigEA252, 
and sigEA253 were constructed by using oligonucleotides 
of 30 bases (30-mers) (Kunkel, 1984). M13mpl8 containing 
either the original l.l-kbp sigE fragment for slgEASl and 
sigEA84) or 1.02 -kbp sigEA8 4 fragment (for sigEA252 and 
sigEA253) was plague purified twice on E. coli CJ23 6 (dut 
ung) and grown on this host to incorporate uracil into 
the phage DNA. Following hybridization to a 3 0-mer, 
Sequenase: 2 . 0 (United States Biochemical Corp., 
Cleveland, Ohio) was used to synthesize the complementary 
strand. Selection for the strand polymerized in vitro 
was accomplished by transforming and plating the 
polymerization mixture on E. coli JM103 (dut+ ung+) , 
which selects against the uracil-containing template. 
From 50 to 90% of the clones arising from the reaction 
contained the desired mutations. These were identified 
by size changes in the cloned B. subtilis fragment and by 
DNA sequencing. 

Strain Construction 

The E. coli lac gene was translationally coupled to 
sigE by cloning a 3 4 0-bp Pstl-SauIIIAl fragment of the 
l.l-kbp clones sigE gene into Pstl-BamHI-cut pSGMU31A2. 
This coupled the 5' end of lacZ in frame to sigE at a 
site 165 bp downstream of the secpaence encoding the amino 
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10 



terminus of P^'. in a similar fashion, the 256-bp 
PstT-SauTTZAl fragment from sigEABA was cloned into this 
vector. This construct was formed at the same site 
within sigrE but had only 81 bp of the slgE protein-coding 
region upstream of lacZ. JH642 was transformed with 
PSGMU31A2 containing the coding sequence for either the 
sigE+~ or sigEA84 -encoded amino terminus. Clones in 
which the plasmid had integrated into the B. suhtilis 
chromosome were selected by chloramphenicol resistance 
(2.5 A^g/ml) and screened by Southern blot analysis for 
integration into the sxgE locus with the proper slgE 
allele positioned upstream of lacZ. 

Strains of JH642 in which the wild-type sigE locus 
15 had been replaced by mutant sxgE alleles were constructed 

by congression of the mutant alleles, cloned as a i- to 
l.l-kbp PstI fragment in JM102, with a streptomycin 
resistance (Str') marker from B. subtilxs 1A287. Str' 
colonies were screened for Spo" by failure of the colonies 
to run brown on DS (Schaeffer et al . , 1965) plates after 
24 h at 37»c. Spo- clones were then screened for Cm' to 
identify those in which the mutant allele had entered the 
chromosome by gene replacement rather than integration of 
the entire plasmid. Chromosomal DNA from clones that met 
these criteria were screened by Southern blot analysis 
for the predicted gene replacement. 

Analysis of Extrac ts for P^' and cr^-like Prot-Pir^g 



20 



25 



30 



35 



B. subtilxs cultures were grown in DS mediiam, cells 
were harvested, and protein extracts were prepared as 
described previously (Trempy et al . , 1985), except that 
the ammonium sulfate step was omitted. Protein samples 
(100 ng) (determined by the Coomassie method, Bio-Rad 
Laboratories) were precipitated with 2 volumes of" cold 
ethahol, suspended in sample buffer, and fractionated on 
sodium dodecyl sulf ate-polyacrylamide gels (12% 
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acrylamide) . Following electrophoretic transfer to 
nitrocellulose and blocking of the nitrocellulose with 
Blotto, the protein bands were probed with an anti-P^* /a*" 
monoclonal antibody (Trempy et al . , 1985). Bound 
5 antibody was visualized by using a horseradish 

peroxidase-conjugated goat immunoglobulin against mouse 
immunoglobulin (Hyclone Laboratories, Inc.) or ^^I-rabbit 
anti-mouse immunoglobulin antibody (K. Krolick, 
University of Texas Health Science Center at San 
10 Antonio) . 

j8-Galactosidase Assay s 

JB. subtilis strains carrying either pSR5 or an 
15 integrated pSGMU3lA2 plasmid were grown in DS medium and 

harvested at various times during growth and sporulation. 
Cells were disrupted by passage through a French pressure 
Cell (twice at 20,000 Ib/in^) and analyzed for 
B-galactosidase activity as described by Miller (Miller, 
20 1972) . 

DNA Sequencing 

DNA sequencing was performed by the method of Sanger 
25 et al . (1977) with the Seguenase reagents (U.S. 

Biochemical Corp.) and the protocol provided by the 
manufacturer . 

Construction of the gigrEA84 allele . 

30 

The sigE gene was reconfigured to encode a a^-like 
protein as its direct product. A single-stranded 
oligonucleotide was hybridized to a region within a 
single-stranded copy of this fragment (cloned in 
3 5 . M13mpl9) . This resulted in the looping out of the DNA 
that encodes the precursor element of P^^. A complement 
to the Ml3-containing DNA was synthesized by using the 
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oligonucleotide as a primer. Following second-strand 
synthesis and destruction of the loop by Si nuclease, the 
DWA mixture was introduced into E. coli by 
transformation. Cloned DNAs which both lost the Hindi! 
site that is unique to the precursor portion of the 
sequence and contained a PstI fragment of approximately 
1.020 bp (rather than the parental l.ioo bp) were picked 
as putative sigrE mutants (sigEA84) , expected to contain a 
sequence in which the region encoding the amino 
terminus is positioned immediately downstream of the P^' 
initiation codon and ribosome-binding site. This was 
verified by sequencing the splice regions of 
constructions that were positive by the restriction 
endonuclease criteria. 

The splice region of one of the clones diverged from 
the parental sequence only after the initiating ATG 
codon. At that point, the sequence encoding the amino 
terminus began in frame with the ATG codon. 

The 1.02-kbp Psti fragment carrying the sigEA84 
allele was cloned into unique PstI sites of the plasmid 
vectors PJM102 and pSI-l to form pJMA84 and pSlA84, 
respectively. The PstI site of each of these plasmids 
lay downstream of an IPTG 

(isopropylthiogalactopyranoside) -inducible promoter) Plac 
in PJM102 and Pspac in pSI-i) . Both plasmids replicated 
in E. coli, but only psi-i replicated in B. subtilis. 
The sigEA84 gene was cloned in the proper orientation for 
expression from the inducible promoters as verified by 
restriction endonuclease analysis of the ability of this 
gene to specify a product in E. coli. 
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Production of Sicrma Protein in Escherichia coli 

5 Cultures of E, coli which carried either the mutant 

or wild-type sigE allele under Plac or Pspac control were 
induced with IPTG. The amount of P^^ or ct^A84 present at 2 
h after induction was determined by Western immunoblot 
analysis ♦ An autoradiogram of the results of this 

10 experiment is presented in Fig. 3. Significant amounts 

of protein that reacted with the anti-P^V^r^ antibody were 
present in all of the extracts. The extracts from 
strains carrying the sigEA84 allele synthesized a protein 
(a^A84) with the apparent mobility of a^, while the 

15 wild-type allele specified a protein with the mobility of 

P^** In addition to the a^A84 band, extracts prepared from 
the sigEA84 strains contained one or more prominent 
lower-molecular-weight proteins that reacted with the 
antibody. The lower-molecular-weight proteins varied in 

2 0 abundance in different extract preparations and probably 

represent breakdown products of a^. They were similar in 
size to but of greater intensity than bands when 
analyzing extracts of B. subtilis cells that synthesize 
P^^ and (Jonas' et al . , 1989). Densitometry measurements 

2 5 of radioactively labeled antibody bound to the Western 

blots reyealed that the amount of the antibody fixed to 
the two major bands seen in the sigEA84 allele encodes a 
protein with the apparent molecular weight of but the 
sigEA84 product accumulated to a level only 10 to 2 0% of 

30 that obtained from the wild-type slgE allele in coli. 

EXAMPLE 2 



Zrfect of the siaEA84 allele on E. subtilis 



35 



JH642 was transformed with pJMA84 . Cells that had 
integrated this nonreplicating plasmid into their 
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chromosomes were selected by chloramphenicol resistance. 
Recombination between pJMA84 and its homologous sequences 
on the B. subtilis chromosome could occur either upstream 
or downstream of the A84 deletion. Recombination 
upstream of the deletion would place the mutant allele 
under the control of the spoIIG promoter, allowing it to 
be expressed. A recombination event downstream of the 
mutation would leave a wild-type allele as the expressed 
copy. The A84 deletion lay approximately 200 bp from the 
upstream end of the 1.02-kpb fragment. it was 
anticipated that approximately 20% of the 
chloramphenicol-resistant transformants should have the 
mutant allele as the expressed copy under conditions of 
random recombination. 

Of the 497 transformants receiving pJMA84, 127 (25%) 
were Spo" (i.e., colonies remained white and began to lyse 
after 24 h at 37«»C on DS medium) . In a similar 
experiment in which the transforming DNA was pJMl02 with 
a wild-type sigE allele (PJME-1) , no Spo" transformants 
were observed (o of 463). Southern blot analysis of four 
Spo- and four Spo" clones which had been transformed with 
■ PJMA84 revealed the wild-type sigE allele as part of the 
spoIIG operon in the Spo+ clones and the sigEA84 allele 
as the operon component in the Spo" clones. 

The sigE gene of JH642 was replaced with slgEA84 by 
transforming JH642 (Phe") with SMY chromosomal DNA (Phe+) 
plus plasmid pJMA84. Phe+ transformants were screened 
for Spo- clones. The chromosomal DNAs were then examined 
by Southern blot analysis. These DNAs contained a 
P^tr-generated fragment of 1.03 kbp rather than the 
parental fragment of i.i kpb, which hybridized to a 
slg-E-specific DNA probe. One of these clones (SEA84-1) 
was characterized further. 
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The activity of the in clone c^A84-l was analyzed 
by using a reporter gene (lacZ) fused to a a^-dependent 
promoter (spoIID) (Rong et al • , 1986). SEA84-1 carrying 
a spoIID::lacZ fusion made virtually no /3-galactosidase 
5 during growth or sporulation in DS medium. Western blot 

analysis failed to detect any at times when is 
normally abundant. No sigma-type protein was detected in 
B. subtilis when an attempt was made to visualize its 
product by immunoprecipitation of a radiolabeled product 
10 or induction in B. subtilis from an spac promoter. Both 

of these methods generate readily detectable P^^ and 
from the wild-type slgE allele. 



EXAMPLE 3 

15 

Translational Fusions to sicrE 



A translation fusion between sigB alleles and the E, 
coli la.eZ gene was constructed. 

20 

As described in the Materials and Methods section, a 
340-bp fragment containing upstream DNA and the coding 
sequence for the first 55 amino acids of P^* was cloned 
into a plasmid vector (pSGMU3lA2) so as to create an 

2 5 in-frame fusion between P^* and the E. coli lacZ gene. A 

similar fusion was created by using the analogous DNA 
fragment (256 bp) from sigEA84. The coding sequence for 
the first 2 6 amino acids from the sigEA84 gene was fused 
to lacZ. The vector used was incapable of replication in 

3 0 B. subtilis but carried an antibiotic resistance gene 

(cat) expressible in B. subtilis . Thus, transformation 
of B. subtilis with the fusion plasmids followed by 
selection for chloramphenicol resistance yielded clones 
in which the plasmid was integrated into its chromosomal 
3 5 homolog. Integration of the plasmid resulted in a Spo" 

phenotype owing to the positioning of the sigE::IacZ 
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f US ion gene within the spoIIG operon in place of the 
wild- type allele. 



When the wild-type sxgEzilacZ fusion plasmid was 
introduced into a sigE* B. subtilis strain, all of the 
clones (38 of 38) displayed a Lac+ phenotype on plates 
(Table 1). In a similar experiment, only 25% (8 of 32) 
of the transformants receiving the sigEA84 : :lacZ DNA were 
Lac+. As was the case in the previous transformation 
experiment, the integration event of ::2acZ into sigE 
could take place either upstream or downstream of the A84 
mutation. Recombination downstream of the mutation would 
result in creation of a wild-type sigE;:2acZ gene. It 
was therefore possible that the Lac+ clones represented 
sigE*:'.lacZ fusions and that slgEA84 : :2acZ clones were 
Lac-. A Southern blot analysis of the chromosomes of a 
representative sample of Lac+ and Lac transformants that 
had received sigEA84 : Jiacz DNA was performed. six of six 
Lac-*- clones had a DNA fragment of the size anticipated 
for wild-type sigE upstream of the lacZ gene, and six of 
six of the Lac- clones had a fragment of the size expected 
for £ragEA84 upstream of lacZ. 

To verify that a Lac- phenotype is a consequence of 
the sigrEA84 sequence being upstream of the lacZ gene the 
transformation into SE84-1, a strain which already 
carries the sigEA84 allele was repeated. In these 
experiments, where the creation of a wild-type 
sigEA84': :IacZ fusion was impossible, all of the 
chloramphenicol-resistant transformants that received the 
sigEA84: :IacZ DNA were Lac (25 or 25), while 
approximately 15% (4 of 27) of the transformants which 
received the sigEABA: -.lacZ DNA were Lac". Analysis of the 
level of B-galactosidase in extracts prepared from 
£rigrEA84: :lacZ transformants that were phenotypibally Lac" 
on plates failed to reveal measurable levels of 
B-galactosidase. Thus, synthesis of the fusion protein 
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mirrors the inability of slgEABA to foirm itself. In 
both cases, no product was detectable by the assay 
systems employed. 

5 TABLE 1 

Transformation of Bo subtllis by slgEz zlacZ DNAs^^ 



No. of colonies 

10 Recipient Donor DNA 

fusion Total Spo' Spo' Lac" Lac 

Examined 



JH642 sigEzilacZ 38 38 0 38 0 

15 sigEA84 : :IacZ 32 32 0 8 24 

SE84 sigrE::IacZ 27 27 0 23 4 

sigEA84 : :IacZ 25 25 0 0 25 



" Competent recipient cells were transformed with plasmid 
20 DNA. Transf ormants were selected on LB plates 

supplemented with chloramphenicol (5 /xgml) . Cm' clones 
were replica plated onto DS plates (22) with and 
without an agar overlay containing X-gal 
(5-bromo-4-chloro-3-indolyl-B-/d- 
25 thiogalactopyranoside) (0. 5 mg/ml) (18) . Spo"^ colonies 

turned brown after 2 4 hr at 3 7**C on DS plates, while 
Spo" colonies remained white. Lac"^ colonies turned blue 
on DS with X-gal by 18 hr at 37 «C. 



30 

The cloned sigE gene was mutagenized by using an 
oligonucleotide (3 0-mer) that would generate an altered 
sigEA84 with a Lys codon (AAA) between the initiation 
3 5 codon (AUG) and the codon specifying the amino terminus 

(GGC) . Mutant clones were isolated, and their structures 
were verified by DNA sequencing. 



40 



When introduced into B. subtilis , sigEA81 behaved 
identically to sigEA84. It conferred a Spo" phenotype on 
cells which carried it (i.e., when sigEA84 was cloned 
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into PJM102 and transformed into JH642, approximately 20% 
of the chloramphenicol-resistant transf ormants were Spo ) , 
and cells into which the slgrEASl allele had been 
congxessed synthesized no detectable a^Asl protein. Thus, 
the addition of AAA at position two of sig-EASl yielded no 
measiirable increase in levels. 

During the course of the mutagenesis protocol a sigE 
mutant was generated (sigEAAS) that encoded a protein 
with the four amino acids of the amino terminus of 
£figEA81 joined by an Arg residue to residue 22 of P^^. 
The sigEA48 mutation was initially identified by the 
intermediate size of the Pstl DNA fragment which carried 
it. Its structure was determined by DNA sequencing. 

When sigEA48 was moved into B. subtllis, the cells 
became Spo". However, unlike sigEA84 and sigEASl, sigEA48 
synthesized an active a^-like protein. The sigEA48 
product was seen in Western blots, and its activity was 
evident by the activation of the spoIID promoter. The 
activation of the spolID promoter by sigEA48 does not 
apparently require the processing of a^A4B into the 
wild-type CT^ protein. Transcription of spoIID occurred in 
B. subtilis strains carrying sigEA48 even if the strain 
had a second mutation in a gene (spoliGA) that is 
essential for processing. In addition, no change was 
detected in the mobility of a^A48 at times in sporulation 
(t^) when the processing activity should be present. 
Thus, CT^A48 is both insensitive to processing and active 
without processing. The level of product present in the 
sigEAAS strain was comparable to that found in the 
wild-type sigE strain. 

Extracts were produced from B . subtilis strains lacking 
a chromosomal copy of sigE but containing plasmid-borne 
copies of either the wild-type (sigE+) or one of the 
mutant alleles of sigE (pSRSl or pSR54) . Western blot 
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analysis for proteins of the size of either P^^ or a^-sized 
proteins in this system indicated no detectable proteins 
in the extracts prepared from the strains with the mutant 
sigE alleles. This result was the same obtained with the 
5 slgEA84 and sigEASl alleles. 

EXAMPLE 4 



jg-qalactosidase Fusion Proteins 



10 



B- subtilis strains expressing )3-galactosidase fusion 
proteins were constructed as described (Jonas et al - , 
1990) • The E, coll lacZ gene was translationally coupled 
to sigE by cloning a 340-bp PstlSauIIIAl fragment of the 

15 lol kbp cloned sigE gene into Pstl-BamHI-cut pSGMU31A2. 

This coupled the 5' end of lacZ in frame to sigE at a 
site 165 bp downstream of the sequence encoding the amino 
terminus of P^^- In a similar fashion, the bp segment 
encoding the polypeptide having the amino acid sequence 

2 0 of Figure 1 was cloned into this vector. This construct 

was formed at the same site within sigE but had only the 
base pairs shown in Figure 2 of the sigE protein-coding 
region upstream of lacZ. JH642 was transformed with 
pSGMU3lA2 containing the coding sequence for either the 

25 sigE"^ or slgEAS 4 -encoded amino terminus. Clones in which 

the plasmid had integrated into the B. subtilis 
chromosome were selected by chloroamphenicol resistance 
(2.5 /ig/ml) and screened by Southern blot analysis for 
integration into the sigE locus with the proper sigE 

30 allele positioned upstream of lacZ. 

The half-life of the fusion proteins expressed from 
transformed Escherichia coll or B. subtilis host cells 
were measured. Table 2 shows the results. 



35 
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Protein Microorganism Half Life 



Sigma £ 


E. 


coll 


ND 




iS-galactosidase 


E. 


coll 


30 


min 


/S-galactosidase 


5. 


subtllls 


ND 




Sigma E:SS' 


E. 


coll 


>2 


hr 


/S-galactosidase: SS^ 


E. 


coll 






/S-galactosidasej SS* 


B. 


subtllls 


>2 


hr 



SS is the 29AA sequence shown in Figure 



EXAMPLE 5 

Struct ural Perturbations in Protein Stabilization 
Sequence 

The effect of perturbation of a-helical structure of 
the protein stabilization sequence shown in Figure 1 was 
examined by altering the primary amino acid sequence • 

The DNA sequence shown in Figure 2 was altered by 
insertion of an arginine codon at position 10, resulting 
in a new sequence encoding a 30-amino acid polypeptide - 
In order to effect the insertion, a restriction site was 
created at codon position 11 by substituting codon CTA 
for CTC- The base pair segment coding for the 3 0-amino 
acid polypeptide and a gene segment encoding )S- 
galactosidase were cloned. Expression of iS-galactosidase 
from a B. subtllls bacterial host cell was examined by 
Western blot. Amounts of expressed fusion polypeptide 
were less than measured in systems utilizing the 29-amino 
acid stabilization sequence. 
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The hydrophobic and hyrlrophilic faces of the a-helix 
formed by the 3 0-amino acid polypeptide were twisted 
relative to each other by about 100**. 

Additional insertions into the DNA sequence encoding 
the 30 amino acid polypeptide were made. Codons CTC and 
GCC encoding leucine and alaline were inserted between 
codon 10 and codon 11 of the base sequence, resulting in 
a 180*» twist of the polypeptide a-helix structure as 
compared with the ct-helix of the 29 amino acid 
polypeptide structure. The new 32 amino acid polypeptide 
was expressed as a fusion polypeptide with /3- 
galactosidase from a transformed B. suhtilxs host. 
Expression was less efficient than with fusions formed 
from either the 2 9 or the 3 0 amino acid protein 
stabilizing sequence, as indicated by Western blot 
analysis . 
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CLAIMS 

1. A protein stabilization sequence comprising an amino 
acid sequence attached to a protein to be stabilized, 

5 said sequence including a number of amino acid residues 

sufficient to form a structure having an outwardly- 
directed hydrophobic face and a positively charged polar 
face. 

10 

2. The protein stabilization sequence of claim 1 
wherein the structure is an or-helix. 

15 3. The protein stabilization sequence of claim 1 having 

up to 100 amino acid residues. 

4. The protein stabilization sequence of claim 1 

20 wherein the outwardly directed hydrophobic face is in 

alignment with the positively charged face. 

5. The protein stabilization sequence of claim 1 

2 5 wherein the amino acid sequence is from about 10 to 

about 50 amino acid residues. 

6. ' The protein stabilization sequence of claim 1 

3 0 wherein the amino acid sequence is 29 amino acid 

residues - 

7. The protein stabilization sequence of claim 6 

3 5 wherein the sequence comprises the sequence of Figure 1 

or a biologically functional equivalent thereof. 
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8. A method of stabilizing proteins comprising 
attaching the protein stabilization sequence of claim 1 
to a proteolytically sensitive protein. 



9 . The method of claim 8 wherein the attaching is to an 
amino terminal end of a proteolytically sensitive 
protein. 



10. The method of claim 8 wherein the attaching is to a 
carboxy terminal end of a proteolytically sensitive 
protein . 



11. The method of claim 8 wherein the attaching is to a 
proteolytically sensitive protein at both terminii. 

12. The method of claim 8 wherein the proteolytically 
sensitive protein is RNA polymerase sigma factor. 

13. The method bf claim 12 wherein the RNA polymerase 
Sigma factor is from B. subtilis or E, coli. 

14. A method of enhancing stability of recombinant 
proteins comprising 

obtaining a gene segment encoding the protein 
stabilization sequence of claim 1; 

preparing a recombinant vector having a gene 

encoding the protein stabilization sequence and 
a gene encoding a protein to be stabilized; 
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transforming a host cell with the recombinant 
vector; 

selecting transformed host cell colonies; and 

facilitating expression of a heterologous protein 
fused N- terminally with the protein 
stabilization sequence . 

15. The method of claim 14 wherein the protein to be 
stabilized is /3-galactosidase. 

16. The method of claim 14 wherein the host cell is a 
bacterium. 



17. The method of claim 14 wherein the host cell is B. 
subtilis or E. coll. 



18. A system for production of a stable/ heterologous 
protein in prokaryotic hosts comprising DNA having a 
coding sequence for the protein stabilization sequence of 
claim 1 operably linked to and out of reading frame with 
a terminal leader DNA sequence encoding a protein 
expressible as an active fusion protein. 

19. A DNA segment which codes for the protein 
stabilization sequence of claim 1 or a functionally 
stabilizing equivalent thereof. 
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20 o The DNA segment of claim 19 having the base pair 
sequence of Figure 2 or a functionally stabilizing 
structural equivalent thereof. 

21 o A recombinant DNA vector comprising a first base 
sequence coding for the protein stabilization sequence of 
claim 1 or a second base sequence complementary to the 
first base sequence. 
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